3574 results found.
Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
From Data Center(s)
License:
Size:
3000 essays OtherProduction Status:
Newly created-in progress
Use:
Natural Language Generation
-
Paper title:Creating Corpora for Research in Feedback Comment Generation
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ryo Nagata | ICNALE Learner Essays with Feedback Comments | /N |
Documentation:
None
Written
Question Answering Dataset,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA 4.0
Size:
45 MByte Production Status:
Existing-used
Use:
Question Answering
-
Paper title:Outbound Translation User Interface Ptakopět: A Pilot Study
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Vilém Zouhar | SQuAD 2.0 | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons BY-NC-SA 3.0
Size:
691 sentences Production Status:
Newly created-finished
Use:
communicative function identification
-
Paper title:An Evaluation Dataset for Identifying Communicative Functions of Sentences in English Scholarly Papers
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kenichi Iwatsuki | An Evaluation Dataset for Identifying Communicative Functions of Sentences in English Scholarly Papers | /N |
Documentation:
https://github.com/Alab-NII/FECFevalDataset/blob/master/README.md
Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
Size:
650 MByte Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:A Chinese Corpus for Fine-grained Entity Typing
-
Paper track:Evaluation/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chin Lee | A Chinese Corpus for Fine-grained Entity Typing | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
1,000,000 documents entries Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Discovering Biased News Articles Leveraging Multiple Human Annotations
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Konstantina Lazaridou | Semeval 2019 - Hyperpartisanship detection | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
1,000 entries Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Discovering Biased News Articles Leveraging Multiple Human Annotations
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Konstantina Lazaridou | Expert data for media bias detection | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
2,000 documents entries Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Discovering Biased News Articles Leveraging Multiple Human Annotations
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Konstantina Lazaridou | Crowd sourced data for media bias detection - Mechanical Turk | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
1,000 documents entries Production Status:
Existing-updated
Use:
Document Classification, Text categorisation
-
Paper title:Discovering Biased News Articles Leveraging Multiple Human Annotations
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Konstantina Lazaridou | Crowd sourced data for media bias detection - Figure Eight | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International (CC BY 4.0)
Size:
400 MByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:RedDust: a Large Reusable Dataset of Reddit User Traits
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anna Tigunova | RedDust: a Large Reusable Dataset of Reddit User Traits | /N |
Documentation:
Documentation in English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
49500 entries Production Status:
Newly created-in progress
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:A Large Scale Speech Sentiment Corpus
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Eric Chen | SwitchBoard Sentiment | /N |
Documentation:
None




